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In the present paper, we study the optimal execution problem under 
stochastic price recovery based on limit order book dynamics. We model 
price recovery after execution of a large order by accelerating the arrival 
of the refilling order, which is defined as a Cox process whose intensity 
increases by the degree of the market impact. We include not only the 
market order but also the limit order in our strategy in a restricted fashion. 

We formulate the problem as a combined stochastic control problem over 
a finite time horizon. The corresponding Hamilton-Jacobi-Bellman quasi- 
variational inequality is solved numerically. The optimal strategy obtained 
consists of three components: (i) the initial large trade; (ii) the unscheduled 
small trades during the period; (iii) the terminal large trade. The size and 
timing of the trade is governed by the tolerance for market impact depending 
on the state at each time step, and hence the strategy behaves dynamically. 

We also provide competitive results due to inclusion of the limit order, 
even though a limit order is allowed under conservative evaluation of the 
execution price. 

Keywords: Optimal execution; Market impact; Limit order book; Price recovery; 

Hamilton-Jacobi-Bellman quasi-variational inequality 


1 Introduction 

The optimal execution problem, which seeks the optimal way to liquidate or acquire a 
large asset position, is a matter of concern for both market practitioners and academic 
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researchers. The pioneering work on the optimal execution problem, Bertsimas and Lo 
IBL981 , provides an intuitive optimal strategy to minimize the purchasing cost under 
a discrete time grid and linear market impact, specifically dividing the amount of the 
target asset equally across the time grid. In seminal papers by Almgren and Chriss 
lACOll and Almgren IIAlm03l , the performance criterion of the execution consists not 
only of the expected revenue but also a penalty for the uncertainty of the revenue. In 
these papers, the market impact is defined by two components: the temporary impact 
and the permanent impact. The former work considers a discrete time model and the 
latter a continuous time model. The approach using market impact consists of tempo¬ 
rary and permanent components in continuous time with various risk-return criteria 
appears in papers such as Schied and Schoneborn IISS09t , Forsyth BForlll , Gatheral and 
Schied nGSllI , Kato BKatlTl and Brigo and Graziano HBD14I . 

A recent trend in the study of the optimal execution problem is to include features 
of the limit order book (LOB). The shape of the LOB, or how the limit orders stack 
in the LOB, is a crucial aspect for modeling the market impact. The resilience of 
the LOB, which leads the price recovery found after the execution of large orders, is 
also a significant feature of the LOB. For studies of the empirical features of the LOB 
and modelling of the dynamics related to the LOB, we refer the reader to Almgren 
lAlmOSi , Large HLarO/l , Bouchaud et. al. ||BFL09i , Hall and Hautsch IIHH07L Muke 
and Farmer I1MF08I , Toke UTokllL Gould et. al. IIGPW'''13II and Gont et. al. lIGSTlOl . 
Research reflecting LOB information in optimal trade execution is addressed in Alfonsi 
and Schied IIASIOL Predoiu et. al. IIPSSlll . Kharroubi and Pham HKPIOI . Guilbaud et. 
al. IGMP13I , and Obizhaeva and Wang IIOW13I . In particular, Obizhaeva and Wang 
IOW13I study linear market impact and exponential-type deterministic price recovery, 
finding an optimal strategy consisting of the initial large trade, pre-scheduled small 
trades during the period, and the terminal large trade. 

In the present paper, we study the optimal execution problem under stochastic price 
recovery based on LOB dynamics. From the viewpoint of the LOB, the price recovery 
after execution of a large order is regarded as refilling the stacked limit order at the 
prior price. We model price recovery by accelerating the arrival of the refilling order. 
Following research on dynamics of the LOB such as HGSTIOI and IILar07l . we adopt the 
Gox process model to describe the arrival of the refilling order. The intensity of the Gox 
process is increased by the degree of the market impact. We define two types of intensity 
functions, linear and exponential, and call the former a weak recovery intensity and the 
latter a strong recovery intensity. 

The price recovery possibly induces a waiting time to ease the market impact. To 
desterilize this waiting time, we include not only the market order but also the limit 
order in our execution strategy. In previous research dealing with the optimal execution 
problem, the limit order is rarely included in the optimal execution strategy as it is an 
intractable order method compared with the market order. Moreover, to the best of 
our knowledge, the dynamics of the market impact due to a large limit order remains 
uncertain. To avoid the complexity due to limit orders, including the market impact, 
we allow a restriction on the limit order. Our restricted limit order strategy is regarded 
as a quasi-iceberg strategy. We again use a Gox process model to describe the dynamics 
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of the counterpart market order arrivals, which turns into a Poisson process under our 
restrictions. 

The first main result of the present study is to provide a dynamic execution strategy 
depending on the state at each time step. Our new price recovery model, which is 
defined as a stochastic process, is the source of the dynamic behavior in the strategy. 
The key to understanding our strategy is tolerance for the market impact depending 
on the time remaining and the current state. The tolerance governs both the size and 
the timing of the trade. A quantitative demonstration of the usefulness of including the 
limit order in the strategy is the second main result. The proposed optimal strategy gives 
competitive results in numerical simulations, even though the limit order is allowed 
under conservative evaluation of the execution price. 

This paper is organized as follows. Section 2 introduces the mathematical formu¬ 
lation of our execution problem. We use the combined stochastic control framework, 
and the market and limit order strategies are formulated by the impulse and regular 
stochastic controls respectively. The goal of this section is to derive the correspond¬ 
ing Hamilton-Jacobi-Bellman quasi-variational inequality (HJBQVI). In Section 3, we 
present a numerical method for solving the HJBQVI, following a similar procedure to 
that found in IIIedl3l . The HJBQVI is discretized by a finite difference scheme and is 
converted to an equivalent fixed point problem. In Section 4, we present numerical 
results under several conditions, specifically the types of recovery intensity functions 
and either including or excluding the limit order. 


2 Model formulation 

We consider the selling order execution problem: we must sell all share holdings before 
the terminal time. Let (Q,;F,{;Ff}f>o,P) be a filtered probability space, Xq be an initial 
number of share holdings and let T be the terminal time. We define our execution 
problem as the combined stochastic control problem over the finite time horizon T. We 
note that we call the number of remaining share holdings as the inventory. 

2.1 Execution Strategy 

We first formulate the market and limit order strategies using impulse and stochastic 
control strategies respectively. We note that we cannot use the both limit and market 
orders at the same time in this framework. 

The market order strategy consists of the timing of each execution and the volume 
of each order. Let {Tj}j>o be a sequence of ^J-stopping times, that is, the sequence of 
execution times of market orders, and let C; be the market order volume at time Zj 
defined as an -measurable JZ^y-valued random variable, where Zj c R'''\{0}. We 
prohibit short selling, that is, Z,j is bounded by the inventory at time Ty. We denote 
V = [zj, C;};> 0 / arid note that v satisfies the definition of impulse control strategy. 

We allow a limit order under the following restrictions: (i) the order volume is small 
enough; (ii) the execution price of the limit order is evaluated as a price that is slightly 
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higher (by the fixed spread size s) than the best bid price, regardless of the quoted price 
(see equation Q in Section [Z3] |. Then our limit order strategy consists of a single factor, 
the limit order volume. We denote the control set by IL c R'*', which is the set of values 
we are allowed to choose as the limit order volume. We describe the arrival of the 
counterpart order to our limit order by a Poisson process (see, for example, ncsTinQ . 
We denote by k e IL, and the limit order volume at time t, the Poisson process 
describing the arrival of the counterpart order and the intensity of respectively. 

2.2 Price process and market impact 

We next define processes related to the asset price. Let Pt be the unaffected part of the 
asset price process for the market sell order at time f, and suppose that Pt follows the 
stochastic differential equation 

(dPt = oit,Pt)dWt, 

|Po = Po e R+. 

We note that Pt is the best bid price, and is not the mid-price. We denote by the 
degree of market impact at time t, which is governed by 

dEf = -b'B.dNf, Tj <t < Tj+i, 

+r(Cy,i), / = 0,1,2,---, (1) 

/+i 

= 0 

where is the Cox process with intensity A“(S“). The market order increases the 
market impact through the function T, and the impact recovers following the process 
N^. 

The process is the focus of the present research. Previous studies describe the 
dynamics of Et with a deterministic function. In contrast, our new model describes 
the dynamics more precisely in the sense that it includes features of the LOB in the 
dynamics. Price recovery is regarded as a random event corresponding to the arrival 
of the limit order in the LOB. This price recovery model is based on the view that (i) 
a large market order consumes the stacked limit order; (ii) the consumed orders at the 
prior price are refilled by new limit orders; (iii) the arrival of the limit order refilling the 
consumed order is more frequent than that of the usual limit order. The frequent arrival 
of the limit order is realized by increasing the intensity A“ in our model. Empirical 
studies such as ||Lar07|| take a similar approach. 

With the perspective above, we define A“(^) to be an increasing function of and 
propose the following two candidates: a strong recovery intensity defined by the expo¬ 
nential function 

A“(^) = Ai(e^2«-l), (2) 

^ In the literature, the arrival is modelled by a Cox process whose intensity depends on the spread size. 
Our situation coincides with a fixed spread size and thus the intensity is a fixed value. Hence arrivals 
are indeed described by a Poisson process. 
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( 3 ) 


and a weak recovery intensity defined by the linear function 

A^(^) = Ai^. 

2.3 Inventory and cash holdings 

Let be the inventory, the number of remaining shares, at time t. Then is governed 
by 


dXf = -ItdN^, Tj<t< Tj+i, 


-^W _ 

r/+i t7 


;+i 


C;+l/ i - 0,1,2, • • • , 


(4) 


X- = xo. 


where is a Poisson process describing the arrival of the counterpart order to our limit 
order. 

We denote by the cash holdings at time t. The dynamics of are described by 

dYf = ItiPt - Sf + s)dN^, Tj <t< Ty+i, 

-n%=n‘'- ; = 0,1,2,---, (5) 

j+i 

Y-o = 0 
wo 

We remark that the profit from sell order execution is evaluated conservatively As 
shown in the second line of equation (j^, the execution price of the market order is 
assessed as the lowest price affected by the market order. The execution price of the 
limit order is appraised slightly higher than the best bid price Pt + Sf, which is regarded 
as the minimum price that can be quoted. Hence the advantage of the limit order under 
the current setting is primarily avoidance of the market impact. 

2.4 HJBQVI 

We define Jf, the performance criterion at time f, as 

J^ix, y, p, ^) = E [giXj, Yt, Pt, Sr)| Xt = x,Yt = y, Pt = p, Hf = ^], 

where w = (Z, r;) is the combined stochastic control strategy and g : IR"'' x Rx Rx R'*' —> R. 
We remark that our performance criterion only depends on the terminal state. The value 
function Vt is defined by 

Vt{x, y, p, ^) := sup/f’(x, y, p, E,), 


and the corresponding HJBQVI is given by (see e.g. H0SO7II 1 
idtVtix, y,Tp, E) + sup {if'Vt(x, i/,p, ^)}, 


max 


teL 








with terminal condition Vjix, y,p, <^) = g{x, y,p, ^), where is the infinitesimal gener¬ 
ator of the multi-dimensional stochastic process (Xt, Yt,Pt, ^t) 


^%(x,y,p,^) = *^-^^djVtix,y,p,^) 

+ [Vt{x -l,y + ip - ^ + s)l, p, 1) - Vt{x, y, p, C)] 

+ A^(^) [Vt{x, y,p,^- 6e) - Vt{x, y, p, ^)], (7) 

is the intensity of N^, and is the operator describing the intervention: 

^^Vt{x, y,p,^)-=Vt{x-(„y + \p-^- r(C) Kp,^- r(C)). (8) 


3 Numerical method 

3.1 Terminal wealth criterion and reduced form of the value function 

To solve the HJBQVI numerically, we first specify the form of the function g. Following 
empirical studies of market impact such as BAlmOSI , we use a power law function to 
describe the market impact: 

r(v) = ( 9 ) 

where 0i, 02 ^ We define the performance criterion as the terminal wealth including 

the terminal execution. The function g is determined by 

g{x, y,p,^) = y + {p-^- r(v))x. ( 10 ) 

The second term of the above equation represents the terminal execution. 

In the current setting, we are able to remove the arguments y and p from the value 
function by substituting the ansatz Vt{x, y, p, = y+x{p-^)+(ptix.^), so that the HJBQVI 
(|^ becomes 

maxidt(pt{x ,+ sup {A^ \(pt{x (ptix,+ /s]} 

\ telL 

+ A^(^) [ptix, ^ - 6h) - + x6s], (11) 

sup Utix - C .^ + r(C)) - xTiQ - (Ptix, .^)) = 0 

CeZ(x) ) 

with terminal condition (prix, ^) = -xT(x). 
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3.2 Discretization of HJBQVI 

We discretize the HJBQVI ( [TT| ) using a finite difference scheme with grid size {bt, bx, h)-- 


max 


+ sup [(ptkixix - I ^k) - (phixix, ^k) + Is]] 

/g1L,s 


+ 


sup 


[(ptkixi,, - 6 h ) - + Xi^bs], 

- C, + T{Q) - x,J{Q - (Ptixi,, 


( 12 ) 


0 , 


where k,ix,h ^ IN/ 4 = kbt, Xi^ = ixbx, = i^b^, and Lg and Z. 5 {x) are the discretized 
sets of IL and Zix). The discretized terminal condition is represented by 

= -XiT{xiJ, 

where the other grid numbers are denoted by ^ and We have 

assumed that the grid sizes bt, bx and 6^ are chosen so that the grid numbers and 

are natural numbers. 

We define the vector (• J hy cb^. .. = (btAxu, ^h), from which we 

obtain the HJBQVI ([T^ in matrix form: 


max 


(f)k+l _ (hk 


QeZ, 


where f/ = [if = [and if = [^1- 




bt 

+ 


(13) 


We now introduce the constant h e R"*" satisfying 

/2>1 + 2(a%(vo)) + A"), 

and convert the HJBQVI ([T^ into the equivalent fixed point problem 


(p^ = max 


sup 

/eLf, 


[V(p’^ + f'^], sup [M^(p’^ + K^] 

C^-36 


(14) 
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where 



i = i = (ix, k) 



rh ' 

1 

II 

II 



~h' 

1 

II 

II 

0, 

otherwise. 


L'.. = 


^ij ~ \i={ix,k)>i=iix-ii,k+il)]' 

K^. 


{ix,k) 


x^AQ- 


We find that, thanks to the factor h, V is a contraction map displayed in matrix form, 
and is a non-expansive map. We refer to IlledlSi Section 3.2] for the procedure to 
solve the problem in equation (141. 


4 Numerical results 

In this section we investigate features of the optimal strategy obtained by the method 
described in the previous section. We divide this section into four subsections depend¬ 
ing on the type of recovery intensity and the restrictions on limit orders. Throughout 
this section, we choose the parameters to obtain the optimal strategy as in Table 


Parameter 

Description 

Value 

^0 

initial inventory 

50 

<5x 

minimum trading volume 

1 


(grid size for Xf) 


6f 

size of time step 

0.001 

<5e 

grid size for Hf 

1 

s 

spread size 

1 

01 

amplitude of market impact 

2 

02 

exponent of market impact 

1 

Ai 

amplitude of price recovery 

1 

A 2 

exponent of price recovery 

1 


Table 1: Parameters related to execution strategy 

The key to understanding the optimal behavior is that the tolerance for the market 
impact depends on the time remaining and inventory. We are able to capture the intuitive 
ideas that (i) the incentive for quick liquidation using the market order becomes higher 
over the course of time, especially near the terminal time; (ii) a lower inventory and 
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a larger market impact induce a wait for price to recover; and (iii) if the limit order is 
available, it is better to quote the limit order as compared with waiting for the price to 
recover. 

4.1 Strong recovery intensity without iimit order 

We first consider the case with strong price recovery intensity as defined in equation 
and set T = 10, = 0 and Zmax = 0. Note that we are not allowed to use the limit 

order strategy under the current setting. 

Figure shows the optimal strategy, where each panel displays a snapshot at the 
time indicated on the top of the panel. A red dot indicates that waiting for the price to 
recover is optimal, while a blue triangle indicates that selling one share, the minimum 
trading volume, by a market order is optimal. Because we prohibit short selling, the 
maximum value of Sf is g{xo - Xt). Hence, the optimal strategy is calculated in the 
triangular regions shown in Figure We observe that the region filled by red dots is 
dominant except in the last panel, which describes snapshot point close to the terminal 
time. We also note that the blue triangles appear in the region where Et takes a small 
value. These results imply that if we observe the large market impact and enough time 
remains, then it is optimal to take no action because the price is expected to recover. 
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Figure 1: The optimal strategy under strong recover intensity without limit order when 
T = 10. Each panel displays a snapshot at the time indicated above the panel. 
A red dot indicates that waiting for the price to recover is optimal, while a 
blue triangle indicates that selling one share, the minimum trading volume, is 
optimal. 

We next find that the region filled by blue triangles expands over the course of time. 
However, Figure does not have enough resolution to discuss the time evolution of 
the optimal strategy. Therefore, we focus on the region where Hf takes a lower value 
as shown in Figure We find that the region filled by blue triangles expands from the 
bottom right corner to the top left corner over the course of time. We also observe that 
selling one share by a market order is optimal when Sf = 0, that is, the market impact 
fully recovers. 
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Figure 2: The optimal strategy, focusing on the region Hf < 10 under strong recovery 
intensity without limit order when T = 10. A red dot indicates that waiting 
for the price to recover is optimal, while a blue triangle indicates that selling 
one share, the minimum trading volume, is optimal. 


The key to understanding the dynamics described in Figure is tolerance for the 
market impact. If the tolerance is high, the optimal strategy is to wait the recovery of 
the market impact, whereas if the tolerance is low, it is optimal to liquidate by market 
orders. The region filled by red dots (respectively blue triangles) in Figure [^is regarded 
as the high (resp. low) tolerance region. The manner of expanding the low tolerance 
region argues that tolerance for the market impact decreases with an increase in the 
inventory and over the course of time. This view is quite consistent with our intuition. 

The interpretation using tolerance for the market impact is supported by Figure|^and 
Figure]^ These figures display sample paths followed by the numerical simulation: the 
upper and lower panels describe the time evolution of the inventory Xf and the degree 
of market impact Et respectively. We observe that sell order execution by a market order 
with minimum trading volume occurs when Hf takes a certain value. The value that 
triggers execution is Hf = 1 before t = 7, after which different behaviors can be detected. 
From Figure]^ and Figurewe find that the triggers are Sf = 0 and Hf = 2 respectively 
We are able to explain this phenomenon by considering the remaining inventory. We 
observe that Xf in Figure Intakes a lower value than in Figure |^in the region after t = 7. 
Thus the tolerance increases in the sample path shown in Figure and decreases in 
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the path shown in Figure Focusing on the lower panel in both figures, we find that 
Sf decreases more frequently in Figure than it does in The natural behavior is to 
speed up execution when price recovery is fast, and so there are frequent arrivals of 
refill orders. 


X 



Figure 3: A sample path under strong re¬ 
cover intensity without limit or¬ 
ders when T = 10. This figure 
describes the fast price recov¬ 
ery case. The upper and lower 
panels display the time evolu¬ 
tion of the inventory Xf and the 
degree of the market impact Sf 
respectively. 



Figure 4: A sample path under strong re¬ 
cover intensity without limit or¬ 
ders when T = 10. This figure 
describes the slow price recov¬ 
ery case. The upper and lower 
panels display the time evolu¬ 
tion of the inventory Xf and the 
degree of the market impact Sf 
respectively. 


From the point of view of tolerance for market impact, and the results when T = 10, 
we predict that (i) if we choose a larger value for T then the tolerance will increase; and 
(ii) as a result, the behavior tends to wait for full price recovery over the entire sample 
paths. A longer simulation gives results that support this. Figure|^displays the optimal 
strategy when T = 50 in the same maimer as in Figure The high tolerance region is 
more dominant compared with that when T = 10. The sample paths vividly describe 
that the optimal strategy is to wait for full price recovery. We choose a sample path in 
the fast price recovery case, where price recovery is found frequently, and display it in 
Figure]^ A sample case of slow price recovery is displayed in Figure]^ Both imply that 
the optimal behavior coincides with our prediction: waiting for full price recovery is 
observed over the entire sample path. 
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Figure 5: The optimal strategy focusing on the region Hf < 10 under strong recovery 
intensity without limit orders when T = 50. A red dot indicates that waiting 
for the price to recover is optimal, while a blue triangle indicates that selling 
one share, the minimum trading volume, is optimal. 
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Figure 6: A sample path under strong 
recovery intensity without limit 
orders when T = 50. This figure 
describes the fast price recov¬ 
ery case. The upper and lower 
panels display the time evolu¬ 
tion of the inventory Xf and the 
degree of the market impact Sf 
respectively. 



Figure 7: A sample path under strong 
recovery intensity without limit 
orders when T = 50. This figure 
describes the slow price recov¬ 
ery case. The upper and lower 
panels display the time evolu¬ 
tion of the inventory Xf and the 
degree of the market impact Hf, 
respectively. 


4.2 Weak recover intensity without iimit orders 

In this subsection, we show the results using the weak recovery intensity defined in 
equation again excluding limit orders. 

Let us discuss the results for T = 10. Figure describes the optimal strategy in the 
same manner as in Figure It is obvious that the triangles representing point with 
low tolerance for market impact occupy a larger region than they do in Figure This 
result indicates that tolerance for market impact is less than in the case studied in the 
previous subsection. The reason is clear: because the intensity in equation Q implies 
that refill orders arrive less frequently compared with the intensity in equation Q, we 
do not expect the price to recover. 

We also find that the time evolution of the tolerance follows the same pattern as in 
the previous subsection. Figure shows a sample path for the current setting in the 
same manner as in Figure We observe that the execution of a sell order occurs when 
Hf takes a certain value, as we found previously. Decreased tolerance is also found in 
the sample path, as the value of Hf that triggers the execution in Figure]^ is higher than 
the value in Figure]^ 


14 























































































10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 - 

7.5 

5.0- 

2.5- 

0.0 

10 . 0 - 

7.5 

5.0- 

2.5- 

0.0 


0.001 


10 20 30 
3.000 


40 50 


10 20 30 
6.000 


40 50 


10 20 30 
9.000 


40 50 


0 10 20 30 40 50 


10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 - 

7.5 

5.0- 

2.5- 

0.0 

10.0 

7.5 
5.0 

2.5 
0.0 


1.000 


0 10 20 30 40 50 
4.000 


0 10 20 30 40 50 
7.000 


0 10 20 30 40 50 
9.999 


10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 - 

7.5- 
5.0 

2.5- 
0 . 0 - 

10 . 0 -: 
7.5 : 
5.0- ; 

2.5- : 
0.0 ■ 


2.000 


0 10 20 30 40 50 

X 


0 10 20 30 40 50 
5.000 


0 10 20 30 40 50 
8.000 


0 10 20 30 40 50 


order method 
and volume 



Figure 8: The optimal strategy focusing on the region < 10 under weak recovery 
intensity without limit orders when T = 10. A red dot indicates that waiting 
for the price to recover is optimal, while a blue triangle indicates that selling 
one share, the minimum trading volume, is optimal. 
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Figure 9: A sample path under weak re¬ 
covery intensity without limit 
orders when T = 10. The up¬ 
per and lower panels display the 
time evolution of the inventory 
Xf and the degree of the market 
impact Hf, respectively 
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Figure 10: A sample path under weak re¬ 
covery intensity without limit 
orders when T = 1. The up¬ 
per and lower panels display 
the time evolution of the inven¬ 
tory Xf and the degree of the 
market impact St, respectively 


The most remarkable finding in this subsection is observed in Figure]^ where we see 
large trades close to the initial and terminal times, unlike the previous case. The large 
trades are described with high resolution in Figure]^ which shows a sample path with 
the same parameters as in Figure except that the terminal time is T = 1. These two 
large trades, however, each have a different structure. 

Let us discuss the large trade near the initial time. We observe that it is constructed 
by a sequential market order with minimum trading volume. The sequential trades 
continue until St hits the trigger value for execution in the short term. This is the result 
of a tradeoff between the following two desires: (i) reducing the inventory to a proper 
level, which we can describe using tolerance, as quick as possible; and (ii) avoiding an 
increase in the market impact St as much as possible. 

The large trade at the terminal time has different characteristics. Focusing on Figure 


initial time; and (ii) St does not increase around the terminal time. These observations 
imply that the large trade at the terminal time is due to the terminal condition in 
equation ( [l0| . Because we are not concerned with what happens after the terminal 
time, this manner of terminal execution appears to be valid. 

We summarize the features of our optimal strategy obtained from the results thus 
far. The strategy is made up of three components: (i) a large initial trade executed by 
sequential market orders with the minimum trade volume; (ii) small trades during the 
time period triggered by tolerance for the market impact; and (iii) a large terminal large 
trade due to the terminal condition. The large trades appearing around the initial and 
terminal times vanish if the execution period is long enough or if the price recovery is 
fast enough (see Section [4T|). The tolerance for market impact plays a central role in the 


10 we see that (i) the slope of Xf at the terminal time is steeper than the slope near the 
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interpretation of the optimal behavior. 

Finally, we compare our optimal strategy with those investigated in previous research 
that models the price recovery as a deterministic process, such as Obizhaeva and Wang 
IOW131 . Both strategies consist of a large initial trade, small trades during the time 
period and a large terminal trade. The significant difference between them is found in 
the small trades. As mentioned above, the trigger for execution, which depends on the 
time remaining, the inventory and the market impact, is at the heart of our strategy. 
Because we model price recovery using stochastic processes as in equation Q, our 
optimal strategy behaves dynamically. The optimal strategies from previous studies 
are static in contrast. They describe price recovery using a deterministic function where 
we know the exact price recovery ex ante and are then able to construct the execution 
strategy for the scheduled orders. Our model fits this view if we substitute Hf into 
the expectation E[Ht]. Hence, the averaged version of our strategy corresponds to the 
previous strategy. 

4.3 Enabling limit orders 

The waiting times found in our strategy investigated in the previous subsections lead 
us to include limit orders in our strategy. Quoting the limit order appears to be efficient 
compared with simply waiting for the price to recover. The parts of our model related 
to the limit order are indeed motivated by this view. Recall that we include the limit 
order with small trade volume and a conservative evaluation of the execution price. 
The approach using the limit order with small trade volume to avoid the market impact 
is similar to the iceberg strategy used by market participants in the real world. Hence 
we call our limit order strategy the quasi-iceberg strategy. In this subsection, we discuss 
how the limit order is included in the optimal strategy. Throughout this subsection we 
fix the parameters as follows: = 0.1, Zmax = 3 and T = 30. 

Figure describes the optimal strategy in a manner similar to Figure although 
using different symbols and colors. We find that the d}mamics of the market order 
part of the optimal strategy follows the same rule as it does without limit orders. We 
observe the new region occupied by the blue plus symbol which indicates that quoting 
a limit order with three shares is optimal. This behavior coincides with our expectations 
mentioned at the beginning of the present subsection. However, we still find that the 
region filled with red dots represents an optimal strategy of waiting for the price to 
recover. The execution price for the limit order is evaluated conservative and hence 
waiting is an optimal strategy when frequent price recovery is expected. 


^ For example, the NYSE offers the block reserve order which corresponds the iceberg strategy. Academic 
research about the iceberg strategy is found, for instance, in Esser and Monch IEM07I . 
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Figure 11: Optimal strategy with limit orders and intensity = 0.1 under weak re¬ 
covery intensity when T = 30. The region shown is for Sf < 10. Each panel 
displays a snapshot at the time indicated above the panel. A red dot indicates 
that waiting for the price to recover is optimal. A yellow triangle, a green 
square and a blue plus symbol indicate quoting a limit order with one, two 
or three shares is optimal, respectively. A purple square indicates that selling 
one share, the minimum traded volume, by a market order is optimal. 


We note that the regions filled with yellow triangles and green squares only appear on 
the boundary between the regions occupied by red dots, which correspond to waiting for 
the price to recover, and by purple squares, which correspond to a quote of Zmax- Because 
there is no adverse effect due to the size of the limit order in our model, there is little 
incentive to choose trading volumes other than Zmax- From the technical perspective, 
the existence of these regions is interesting because it implies that the optimal strategy 
changes smoothly from waiting to quoting a limit order with Z^ax shares. 

The tolerance for market impact continues to give a powerful interpretation in the 
current case. The optimal strategy has the following three phases. In the first phase, 
the tolerance is high and waiting is chosen as the optimal behavior. The tolerance then 
decreases and the optimal strategy changes to quoting a limit order with Z^ax shares. 
In the third phase, the tolerance hits the trigger discussed in the previous subsections, 
so immediately selling one share, the minimum trading volume, by a market order is 
optimal. 
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The strategy of waiting, however, does not appear in the investment simulation. 
Figure displays a sample path under the current settings. In the lower right panel 
of the figure, which displays the optimal strategy over the time grid, we only observe 
immediately selling of one share by a market order and quoting a limit order with three 
shares. This observation implies that the market order part of the optimal strategy does 
not allow Hf to exceed the trigger value, which changes the strategy to waiting. Figure 
[l^ also suggests that the dynamics of the market order part of the optimal strategy is 
unchanged from the discussion above. 



Figure 12: A sample path with limit orders and intensity = 0.1 under the weak 
recovery intensity when T = 30. The upper left and right panels display 
the time evolution of the inventory Xt and the degree of the market impact 
Hf respectively. The lower left panel indicates the cumulative order volume 
executed by the limit orders. The lower right panel describes the optimal 
strategy: a blue bar indicates that immediately selling one share by a market 
order is optimal; a red bar indicates that quoting a limit order with three 
shares is optimal. 

In this sample path, the quasi-iceberg strategy works well. The cumulative order 
volume executed by the limit orders is described in the lower panel of Figure We 
observe that the limit order executions occur six times and that 18 shares are liquidated. 
This liquidation due to the limit orders leads to low tolerance for the market impact 
found near the terminal time in the upper right panel in Figure]^ which indicates the 
time evolution of Hf. Another contribution of the quasi-iceberg strategy is demonstrated 
by the performance analysis discussed in the next subsection. 
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4.4 Performance analysis 

In this subsection, we present a performance analysis of our optimal strategy. The 
results in the previous subsections suggest that we are able to reduce the market impact 
by increasing the terminal time T and taking a longer time span to liquidate the target 
asset. On the other hand, a larger value for T leads to large fluctuations from the 
initial price, as mentioned in the introduction. We quantify this tradeoff through a 
performance analysis using the liquidation rate Rj defined as follows: 

Yt - (Pr - S7 - ^(Xt’))X7 
Kj = -. 

XoPo 

The numerator of the above equation is the terminal wealth including the terminal 
execution and the denominator is the wealth obtained under perfect liquidity. Therefore, 
the liquidation rate is defined as the ratio between the total wealth with and without 
market impact. We run the simulation under weak recovery intensity and evaluate the 
expectation and the standard deviation of the liquidation rate, E[R 7 ] and SD(R 7 ’), for 
each of the terminal times given in Table We use geometric Brownian motion as the 
price process, that is, o{t,p) = op, and produce 10^ sample paths using the parameters 
in Table 1^ as well as the parameters in Table 


Parameter 

Description 

Value 

T 

terminal time 

1,3, 5,10,20, 30, 50 

0 

volatility 

0.08 

Po 

initial price 

150 

6p 

grid size for Pt 

1 

- 

number of sample paths 

10^ 


Table 2: Parameters related to numerical simulation 

Let us first discuss the case that excludes the limit order. Figure]^ plots E[R 7 ] and 
SD(Rr) on the vertical and horizontal axes respectively. The points are colored with 
a lighter blue as T increases and are interpolated linearly. We observe that there is a 
monotonic increase for both E[Rr] and SD(R 7 ’). The former and one of the features of 
our strategy imply that the improvement in performance relies on reducing the market 
impact. The latter is obviously caused by price fluctuation because of the large terminal 
time. Hence, using E[R 7 ’] and SD(R'r) to quantify the tradeoff between avoidance of 
market impact and price fluctuation is somewhat primitive. The implication is obtained 
from the shape of the curve in Figure Because the curve is concave, the amount of 
change in SD(R 7 ) required to improve E[R 7 ] a certain amount becomes larger as E[R'r] 
increases. This implies that, over a long time span, liquidating the asset is not profitable. 
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Figure 13: Performance graph when limit 
orders are prohibited. The ex¬ 
pectation E[R'r] and the stan¬ 
dard deviation SD(R 7 ) are plot¬ 
ted on the vertical and horizon¬ 
tal axes respectively. The re¬ 
sults are plotted using points 
colored with a lighter blue as 
the terminal time T increases, 
and the points are interpolated 
linearly. 


Figure 14: Performance graph when limit 
orders are permitted. The ex¬ 
pectation E[R'r] and the stan¬ 
dard deviation SD(Rr) are plot¬ 
ted on the vertical and hori¬ 
zontal axes respectively. The 
dashed blue line and the solid 
red line describe the perfor¬ 
mance with and without the 
limit order, respectively. The 
blue line is the same as that dis¬ 
played in Figure [l^ 


We next consider the case where limit orders are permitted. Recall that the limit 
order in our optimal strategy replaces the strategy of waiting for the price to recover 
found in the optimal strategy without the limit order. Thus we expect an improvement 
in performance due to the limit order. Figure 14 plots E[R 7 ’] and SD(R 7 ’) in the same 
manner as in Figurel^and supports this expectation. The dashed blue line and the solid 
red line in Figure |14| represent the results with and without the limit order respectively. 
We note that the former line is the exactly same as in Figure We observe that the 
solid red line appears on the upper left side of the dashed blue line. This indicates 
that enabling limit orders leads to an improvement both in the risk and the return. 
We find that the improvement stands out when T > 10. The limit order is executed 
when the counterpart order arrives, and the number of limit order executions declines 
if the terminal time is a small value. Hence, the improvement due to the limit order 
is relatively small unless the terminal time is large enough. We also find that main 
contribution to the improvement comes from the decrease in SD(R 7 ’). We impose a 
conservative evaluation on the execution value of the limit order, which appears to 
restrict the improvement in E[Rx]. Therefore we expect further improvements in E[R'r] 
under a more sophisticated evaluation of the limit order execution. 
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5 Concluding remarks 


In this paper, we study the optimal execution problem under stochastic price recovery 
based on LOB dynamics. We model the price recovery after execution of a large order by 
accelerating the arrival of the refilling order. The arrival of the refilling order is defined 
as a Cox process whose intensity is increased by the degree of the market impact. We 
include not only the market order but also the limit order in our execution strategy albeit 
in a restricted fashion. Our restricted limit order strategy is regarded as a quasi-iceberg 
strategy. We formulate our execution problem as a combined stochastic control problem 
over a finite time horizon. The corresponding HJBQVI is solved numerically using a 
scheme similar to that proposed by leda IIIedl3l . The performance criterion is used to 
maximize the terminal wealth including the terminal execution. 

The optimal execution strategy without the limit order is made up of three compo¬ 
nents: (i) a large initial trade executed by sequential market orders with the minimum 
trading volume; (ii) unscheduled small trades during the time period; and (iii) a large 
terminal trade due to the terminal condition. The heart of our strategy is the tolerance 
for the market impact, which depends on the time remaining and inventory: if the 
tolerance is high, then the optimal strategy is to wait for the recovery of the market 
impact, while if the tolerance is low, then it is optimal to liquidate by market orders. 
The timing of the small trades, the second component of the strategy, is triggered by 
the tolerance. Since the tolerance reflects the state at the time, our optimal strategy is 
dynamic. The tolerance also governs the size of the large trades appearing around the 
initial and terminal times. These trades vanish if the execution period is long enough 
or if the price recovery is fast enough, which leads to a high tolerance situation. 

The intervals between the small trades appearing in the above strategy motivate 
including limit orders in the strategy. Enabling limit orders under the restriction that 
a limit order does not affect the price dynamics, the strategy of waiting for the price 
to recover observed when limit orders are excluded is replaced by quoting the limit 
order. Although we impose a conservative evaluation on the limit order execution, our 
performance analysis based on the liquidation rate, the ratio between the total wealth 
with and without the market impact, demonstrates that the improvement is caused by 
the inclusion of the limit order. 
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